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Abstract. We compare the solvability of the Consensus and Broadcast problems in synchronous 
communication networks in which the delivery of messages is not reliable. The failure model is 
the mobile omission faults model. During each round, some messages can be lost and the set of 
possible simultaneous losses is the same for each round. We investigate these problems for the 
first time for arbitrary sets of possible failures. Previously, these sets were defined by bounding 
the numbers of failures. In this setting, we present a new necessary condition for the solvability 
of Consensus that unifies previous impossibility results in this area. This condition is expressed 
using Broadcastability properties. As a very important application, we show that when the sets 
of omissions that can occur are defined by bounding the numbers of failures, counted in any 
way (locally, globally, etc.), then the Consensus problem is actually equivalent to the Broadcast 
problem. 

1 Introduction 

We consider synchronous communication networks in which some messages can be lost during 
each round. These omission faults can be permanent or not; a faulty link can become reliable 
again after an unpredictable number of rounds, and it can continue to alternate between being 
reliable and faulty in an unpredictable way. This model is more general than other models, 
such as component failure models, in which failures, once they appear somewhere, are located 
there permanently. The model that we use, called the mobile faults or dynamic faults model, 
was introduced in [SW89] and is discussed further in [SW07] . An important property of these 
systems is that the set of possible simultaneous omissions is the same for each round. In some 
sense, the system has no "memory" of the previous failures. Real systems often exhibit such 
memory- less behaviour. 

In previous research, the sets of possible simultaneous omissions were defined by bounding 
the numbers of omissions. Recent work on this subject includes [SW07J, in which omissions 
are counted globally, and [SWK09] . in which the number of omissions is locally bounded. 
It has also been shown to be good for layered analysis [MR02]. In this paper we consider 
the most general case of such systems, i.e. systems in which the set of possible simultaneous 
omissions is arbitrary. This allows the modelling of any system in which omissions can happen 
transiently, in any arbitrary pattern, including systems in which the communications are not 
symmetric. 

We investigate two fundamental problems of Distributed Computing in these networks: the 
Consensus problem and the Broadcast problem. While it has long been known that solvability 
of the Broadcast problem implies solvability of the Consensus problem, we prove here that 
these problems are actually equivalent (from both the solvability and complexity points of 
view) when the sets of possible omissions are defined by bounding the number of failures, for 
any possible way of counting them (locally, globally, any combination, etc.). 
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1.1 The Consensus Problem 

The Consensus problem is a very well studied problem in the area of Distributed Algorithms. 
It is defined as follows. Each node of the network starts with an initial value, and all nodes 
of the network have to agree on a common value, which is one of the initial values. Many 
versions of the problem concern the design of algorithms for systems that are unreliable. 

The Consensus problem has been widely studied in the context of shared memory systems 
and message passing systems in which any node can communicate with any other node. 
Surprisingly, there have been few studies in the context of communication networks, where 
the communication graph is not a complete graph. In one of the first thorough studies S\\ ()7j. 
Santoro and Widmayer investigate some k— Majority Problems that are defined as follows. 
Each node starts with an initial value, and every node has to compute a final decided value 
such that there exists a value (from the set of initial values) that is decided by at least k of the 
nodes. The Consensus problem (called the Unanimity problem in |SW07 j^]) is the n— Majority 
problem where n is the number of nodes in the network. 

In their paper, Santoro and Widmayer give results about solving the Consensus problem in 
communication networks with various types of faults including omission faults. For simplicity, 
we focus here only on omission faults. We believe that our results can be quite easily extended 
to other fault models, using the methodology of |SW07| . 

1.2 The Broadcast Problem 

Two of the most widely studied patterns of information propagation in communication net- 
works are broadcasting and gossiping. A broadcast is the distribution of an initial value from 
one node of a network to every other node of the network. A gossip is a simultaneous broad- 
cast from every node of the network. The Broadcast problem that we study in this paper is 
to find a node from which a broadcast can be successfully completed. 

There are close relationships between broadcasting and gossiping, and the Consensus 
problem. Indeed, the Consensus problem can be solved by first gossiping and then applying 
a deterministic function at each node to the set of initial values. But a gossip is not actually 
necessary. If there exists a distinguished node vq in the network, then a Consensus algorithm 
can be easily derived from an algorithm that broadcasts from vq. However the Broadcast 
problem and the Consensus problem are not equivalent, as will be made clearer in Section |3j 

1.3 Our Contributions 

In this paper, we investigate systems in which the pattern of omission failures is arbitrary. A 
set of simultaneous omissions is called a communication event. We characterize the solvability 
of the Broadcast and Consensus problems subject to an arbitrary family of possible com- 
munication events. A node from which it is possible to broadcast if the system is restricted 
to a given communication event is called a source for the communication event. We prove 
that the Broadcast problem is solvable if and only if there exists a common source for all 
communication events. To study the Consensus problem, we define an equivalence relation 
on a family of communication events based on the collective local observations of the events 

1 Note that some of the terminology that we use in this paper is different from the terminology of Santoro and 
Widmayer. We are investigating relationships between different areas of distributed algorithms, and some 
terminology (such as k— agreement) has different meanings in the different areas. 
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by the sources. We prove in Theorem 4.10 that the Consensus problem is not solvable for a 
family of communication events if the Broadcast problem is not solvable for one class of the 
equivalence relation. For Consensus to be solvable, the sources of a given event must be able to 
collectively distinguish communication events with incompatible sources. We conjecture that 
this is actually a sufficient condition. 

It is very simple to characterize Broadcastability (see Theorem 4.4), so we get very simple 
and efficient impossibility proofs for solving Consensus subject to arbitrary omission failures. 
These impossibility conditions are satisfied by the omission schemes of [SW07J and [SWK09]. 
This means that our results encompass all previous known results in the area. 

Furthermore, we prove that under very general conditions, in particular when the pos- 
sible simultaneous omissions are defined by bounding the number of omissions, for any way 
of counting omissions, there is actually only one equivalence class when the system is not 
broadcastable. An important application is that, the Consensus problem is exactly the same 
as the Broadcast problem for most omission fault models. Therefore, it is possible to deduce 
complexity results for the Consensus problem from complexity results about broadcasting 
with omissions. 



1.4 Related Work 

In [CBS09J, the authors present a model that can describe benign faults. This model is called 
the "Heard-Of" model. It is a round-based model for an omission-prone environment in which 
the set of possible communication events is not necessarily the same for each round. However, 
they require a time-invariance property. As a special case, they present a characterization 
that shows that solving Consensus in this environment is equivalent to solving a Selection 
problem. They also present algorithms for some families of omission schemes. It is not possible 
to derive our simple characterizations from [CBS09J. 

In [SW07J, the Consensus and related Agreement problems are investigated for networks in 
which there are at most / omissions during any given round. It is proved that it is impossible 
to solve Consensus if / is at least the minimum degree of the graph. A Consensus algorithm 
is presented for the case where / is strictly smaller than the connectivity of the network. In 
[FG1T], a reduction to the two process case is used to show that the connectivity of the graph 
is indeed the exact limit for consensus is such omission schemes. In this paper, we generalize 
these results, showing that exact limits for Consensus can be derived from exact limits for 
Broadcast. 

While the above studies use a global failure metric, a local failure metric is investigated 
in [SWK09], distinguishing send and receive omissions. The authors describe which bounds 
allow Consensus to be solved, using a proof technique based on a Withholding Lemma. They 
claim that Consensus is solvable if and only if no node can withhold its information from some 
other part of the network. We will show that this is not true when the pattern of omissions can 



be arbitrary. We present (in Example 2.5) a system in which nodes can withhold information 
infinitely but Consensus is solvable. 

Finally, it is worth noting that, although both [SW07J and [SWK09^ are using the clas- 
sic bivalency proof technique, it is not possible to derive any of the results of fS\V()7| or 
[FG11J (global bound on omissions) from the results of [SWK09J (local bound) as the omis- 
sion schemes are not comparable. Our results consolidate these previous results. Furthermore, 
our approach is more general than these previous approaches and is more suitable for appli- 
cations to new omission metrics. 
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2 Definitions and Notation 



2.1 Communication Networks 

We model a communication network by a digraph G = (V, E) which does not have to be 
symmetric. If we are given an undirected graph G, we consider the corresponding symmetric 
digraph. We always assume that nodes have unique identities. Given a set of arcs E, we define 
h(E) = {t | (s, t) G E}, the set of nodes that are heads of arcs in E. 

All sub-digraphs that we consider in this paper are spanning subgraphs. Since all spanning 
subgraphs of a digraph have the same set of nodes, we will use the same notation to refer to 
both the set of arcs of a sub-digraph and the sub-digraph with that set of arcs when the set 
of nodes is not ambiguous. 

2.2 Omission Schemes 

In this section, we introduce our model and the associated notation. Communication in our 
model is synchronous but not reliable, and communication is performed in rounds. Commu- 
nication with omission faults is described by a spanning sub-graph of G with the semantics 
that are specified in Section |2.4| 

Throughout this paper, the underlying graph G = (V, E) is fixed, and we define the set 
E = {(V,E') | E' C E}. This set represents all possible simultaneous communications given 
the underlying graph G. 

Definition 2.1. An element of E is called a communication event (or event for short). An 
omission scenario ( or scenario for short) is an infinite sequence of communication events. 
An omission scheme over G is a set of omission scenarios. 

A natural way to describe communications is to consider E to be an alphabet, with 
communication events as letters of the alphabet, and scenarios as infinite words. We will use 
standard concatenation notation when describing sequences. If w and w' are two sequences, 
then ww' is the sequence that starts with the ordered sequence of events w followed by the 
ordered sequence of events w' . This notation is extended to sets in an obvious way. The empty 
word is denoted s. We will use the following standard notation to describe our communication 
schemes. 

Definition 2.2 ([PP04]). Given R C E, R* is the set of all finite sequences of elements of 
R, and R u is the set of all infinite ones. 

The set of all possible scenarios on G is then E w . A given word w G E* is called a partial 
scenario and \w\ is the length of this partial scenario. An omission scheme is then a subset 
S of E u . A mobile omission scheme is a scheme that is equal to R u for some subset R C E. 
In this paper, we consider only mobile omission schemes. Note that we do not require G 
to belong to R. A formal definition of an execution subject to a scenario will be given in 
Section |2.4| Intuitively, the r-th letter of a scenario will describe which communications are 
reliable during round r. 

Finally, we recall some standard definitions for infinite words and languages over an al- 
phabet E. Given w = [a\, 02, . . .) G E w , a subword of w is a (possibly infinite) sub-sequence 
(flWl)) Ocr(i)j • • •)> where a is a strictly increasing function. A word u £ E* is a prefix of w G E* 
(resp. w' G E u ) if there exists v G E* (resp. v' G E u ) such that w = uv (resp. w' = uv'). 
Given w G E u and r G N, w\ r is the finite prefix of w of length r. 
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Definition 2.3. Let w G E w and L C U u . Then Pref(w) = {u £ U*\u is a prefix of w}, and 

Pref(L) = |J Pref(w). A word w' is an extension of w in L, if ww' G L. 

weL 

2.3 Examples 

We do not restrict our study to regular sets, however all omission schemes known to us are 
regular, including the following examples, so we will use the notation for regular sets. We 
present examples for systems with two processes but they can be easily extended to any 
arbitrary graph. The set £ = {o^>», o<— o— o •} is the set of directed graphs with two 
nodes o and •. The subgraphs in U describe what can happen during a given round with the 
following interpretation: 

— o±5« : all messages that are sent are correctly received; 

— o<— • : the message from process o, if any, is not received; 

— o— : the message from process •, if any, is not received; 

— o • : no messages are received. 

Example 2.4- The set {ot^»} w corresponds to a reliable system. The set 0\ = {o±^», cx— •, o— >•} 
is well studied and corresponds to the situation in which there is at most one omission per 
round. 

Example 2.5. The set H = {o-<— •, o— describes a system in which at most one message 
can be successfully received in any round, and if only one message is sent, it might not be 
received. 

The examples above are examples of mobile omission schemes. The following is a typical 
example of a non-mobile omission scheme. 

Example 2.6. Consider a system in which at most one of the processes can crash. From the 
communications point of view, this is equivalent to a system in which it is possible that no 
messages are transmitted by one of the processes after some arbitrary round. The associated 
omission scheme is the following: 

Ci = {o^. w } U {o^.}*({o^} U {o-* w }). 

2.4 Reliable Execution of a Distributed Algorithm Subject to Omissions 

Given an omission scheme S, we define what is a successful execution of a given algorithm A 
with a given initial configuration i. Every process can execute the following communication 
primitives: 

— send(v,msg) to send a message msg to an out-neighbour v, 

— recv(v) to receive a message from an in-neighbour v. 

An execution, or run, of an algorithm A subject to scenario w £ S is the following. 
Consider process u and one of its out-neighbours v. During round r € N, a message msg is 
sent from u to v, according to algorithm A. The corresponding recv(v) will return msg only 
if E' , the r-th letter of w, is such that (u,v) G E' . Otherwise the returned value is null. All 
messages sent in a round can only be received in the same round. After sending and receiving 
messages, all processes update their states according to A and the messages they received. 
Given u G Pref{w), let s x {u) denote the state of process x at the end of the |it|-th round 



5 



of algorithm A subject to scenario w. The initial state of x is l(x) = s x (e). A configuration 
corresponds to the collection of local states at the end of a given round. An execution of A 
subject to w is the (possibly infinite) sequence of such message exchanges and corresponding 
configurations. 

Remark 2. 7. With this definition of execution, the environment is independent of the actual 
behaviour of the algorithm, so communication failures do not depend upon whether or not 
messages are sent. This model is not suitable for modelling omissions caused by congestion. 
Sec [KKR03J for examples of threshold-based omission models. 

Definition 2.8. A algorithm A solves a problem V subject to omission scheme S with initial 
configuration i, if, for any scenario w G S, there exists u G Pref{w) such that the state s x (u) 
of each process x G V satisfies the specifications of V for initial configuration l. In such a 
case, A is said to be 5-reliable for V. 

Definition 2.9. If there exists an algorithm that solves a problem V subject to omission 
scheme S, then we say that V is S— solvable. 

Remark 2.10. We emphasize that for an algorithm, "knowing" the omission scheme against 
which it runs is not the same as knowing whether or not a given message is actually received. 

3 The Problems 

3.1 The Binary Consensus Problem 

A set of synchronous processes wishes to agree about a binary value. This problem was first 
identified and formalized by Lamport, Shostak and Pease [PSL80]. Given a set of processes, 
a consensus protocol must satisfy the following properties for any combination of initial val- 
ues |Lyn 96|: 

— Termination: every process decides some value; 

— Validity: if all processes initially propose the same value v, then every process decides v; 

— Agreement: if a process decides v, then every process decides v. 

Consensus with these termination and decision requirements is more precisely referred to 
as Uniform Consensus (see [Ra y02| for a discussion). Given a fault environment, the natural 
questions are: is Consensus solvable, and if it is solvable, what is the minimum number of 
rounds to solve it? 

3.2 The Broadcast Problem 

Let G = (V, E) be a graph. There is a broadcast algorithm from u G V, if there exists an 
algorithm that can successfully transmit any value stored in u to all nodes of G. 

The Broadcast problem on graph G is to find a u G V and an algorithm A such that A 
is a broadcast algorithm from u. Given an omission scheme S on G, G is 5-broadcastable if 
there exists a u G V such that there is an 5-reliable broadcast algorithm A from u. 
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3.3 First Reduction 



The next proposition is quite well known but leads to very interesting questions. 

Proposition 3.1. Let G be a graph andS an omission scheme for G . IfG is S -broadcastable, 
then Consensus is S-solvable on G. 

Proof. If G is 5-broadcastable then there exists a node u and an algorithm A for broadcasting 
any value from u subject to S. The consensus algorithm uses A to broadcast the initial value 
of u, and then every node decides the value received from u. As the value that u broadcasts is 
one of the initial values, the algorithm satisfies all three conditions, and is obviously 5-reliable. 
□ 

We now present an example that shows that the converse is not always true. 



Example 3.2. The omission scheme % = {°^— of Example 2.5 is an example of a 
system for which there is a Consensus algorithm but no Broadcast algorithm. 

It is easy to see that it is not possible to broadcast from o (resp. •) subject to H because 
cK— « w (resp. o— )-« aJ ) is a possible scenario. However, the following one-round algorithm (the 
same for both processes) is an T~L— reliable Consensus algorithm: 

— send the initial value; 

— if a value is received, decide this value, otherwise decide the initial value. 

This algorithm is correct, as exactly one process will receive a value, but it is not possible to 
know in advance whose value will be received. 

We propose to study the following question: when is the solvability of Consensus equivalent 
to the solvability of Broadcast? That is, given a graph G, what are the mobile omission 
schemes S on G such that Consensus is 5-solvable and G is 5-broadcastable. In the process 
of answering this question, we will give a simple characterization of the solvability of Broadcast 
and a necessary condition for the solvability of Consensus subject to mobile omission schemes. 



4 Broadcastability 

4.1 Flooding Algorithms 

We start with a basic definition and lemma. 

Definition 4.1. Consider a sub-digraph H ofG and a node u G V . A node v G V is reachable 
from u in H if there is a directed path from u to v in H . Node u is a source for H if every 
v £ V is reachable from u in hi . 

In a flooding algorithm, one node repeatedly sends a message to its neighbours, and each 
other node repeatedly forwards any message that it receives to its neighbours. The following 
useful lemma (from folklore) about synchronous flooding algorithms is easily extended to the 
omission context. Let T r u denote a flooding algorithm that is originated by u G V and that 
halts after r synchronous rounds. 

Lemma 4.2. A node u G V is a source for H if and only if for allr > \ V\, T r u is '—reliable 
for the Broadcast problem. 
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4.2 Characterizations of Broadcastability with Arbitrary Omissions 



We have the following obvious but fundamental lemma. We say that a node is informed if it 
has received the value from the originator of a broadcast. 

Lemma 4.3. Let u £ V, r £ N, and let Inform(w) be the set of nodes informed by J 7 ^ 
under the partial execution subject to w £ U* . Then for any subword w' of w, Inform(w') C 
Inform(w). 

Theorem 4.4. Let G be a graph and R a set of communication events for G. Then G is 
R? —broadcastable if and only if there exists u £ V that is a source for all H £ R. 

Proof. In the first direction, suppose that we have a broadcast algorithm from a given u that 
is R u — reliable. Then an execution subject to H u is successful for any H £ R, so u is a source 



for H by Lemma 4.2 



In the other direction, choose the flooding algorithm J-^ x to be the broadcast algorithm 
and consider a scenario w £ B)^*\ v \. There is an event H £ R that a ppea rs at least \V\ times 



m w, hence H\ v \ is a subword of w. As u is a source for H by Lemma |4.2[ Inform(H^ v ^) = V. 



By Lemma 4.3, Lnform{w) = V, and the flooding algorithm is i?^— reliable. □ 
Definition 4.5. The set of sources of an event H £ U is B(H) = {u £ V \ u is a source for H}. 

Definition 4.6. Let H±, . . . , H q £ U. Then the set {H±, . . . , H q } is source-incompatible if 

1. VI <i<q,B{Hi)^%, 

2. B{Hi) = 0. 

l<i<g 



With these definitions we can restate the Broadcastability theorem (Theorem 4.4): 

Theorem 4.7. Let G be a graph and R a set of communication events for G. Then G 
is R u — broadcastable if and only if every event in R has a source, and R is not source- 
incompatible. 



4.3 A Converse Reduction 

Given a subset X C V of vertices, and H an event we denote Lnx(H) = {(v,u) £ H \ u £ X}. 
Given R an omission scheme. We now define a more precise relation about indistinguishability. 

Definition 4.8. Given three directed graphs G,H,K £ R,we define the following relation 
denoted by GaxH if Ln^^(G) = In B ^(H). The relation a* is the transitive closure of ax 
relations for any K £ R. 

We denote (3 the coarsest equivalence relation included in a* such that for all graphs G, H 
(Closure Property) G(3H ==> 3Hq, . ■ ■ , H q and K\, . . . , K q such that 

- G = Hq, H = H q , 

- Vi > l,HiPG,KiPG, 

- Mi > 0,Hia Kl H i+1 . 

The relation ax describes how some communication events are indistinguishable to the 
all the nodes of B(K). The relation f3 is well defined as the equality relation satisfies such 
a closure property. And for any two relations R\ and R2 that satisfy the property, we have 
Ri U i?2 that satisfies the Closure. 
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Example 4-9. In 0\ from Example |2.4[ there is only one equivalence class. Let's see why. 
First, the sets of sources to consider are: 

- = {•}; 

- = W; 

- S(o±5.) = {o,.}. 

We have o<— • a/ i ot^» and o— )>• a{.} ot^». Therefore, all communication events are /3— equi- 



valent. In Example 2.5, /3 has two equivalence classes, and every node can distinguish imme- 



diately which communication event happened. 

As will be seen later in Section [6j the omission schemes in |SW07| and [SWK09] , and more 
generally, all schemes that are defined by bounding the number of omissions in some way, 
have only one /3— class when they are source-incompatible. Finally, we can now state the main 
theorem. 

Theorem 4.10. Let G a graph and R a set of communication events for G. If Consensus is 
R* 3 — solvable then for every (3r— class C, G is C w —broadcastable. 

5 Proof of Main Theorem 

5.1 Events without Sources 

First we consider the cases in which there are events without sources. 

Proposition 5.1. If there is an H £ R that has no source, then Consensus is not R u —solvable. 

Proof. If H has no source, then there exist two non-overlapping, non-empty subsets of nodes 
Uq and U\ such that there are no paths in H from V\U{ to Ui, i = 0,1. 
We consider the three following initial configurations: 

1. Iq: initial value is at every node, 

2. I\: initial value is 1 at every node, 

3. /: initial value is if and only if the node belongs to Uq. 

Under scenario H u , I is not distinguishable from Iq (resp. I{) for Uq (resp. U\). So subject 
to H^, the algorithm decides in Uq and 1 in U\, and this contradicts the Agreement property. 
□ 

The proof of the main theorem uses an approach that is similar to the adjacency and 
continuity techniques of [SW07]. So, we will first prove these two properties. What should 
be noted is that the adjacency and continuity properties are mainly consequences of the fact 
that the scheme is a mobile scheme. 

5.2 An Adjacency Property 

Lemma 5.2. Let H be a subgraph of G, and let (s,t) G H . If t G B{H) then s G B(H). 

Proposition 5.3. Let H G R and w,w' G R* such that \w\ = \w'\ and s p (w) = s p (w') for all 
p G B(H). Then for all k G N and all p G B(H), s p {wH k ) = s p {w'H k ). 



Proof. The proof relies upon Lemma 5.2 which implies that processes from B(H) can only 



receive information from B(H) under scenario H , for any k G N. □ 
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5.3 A Continuity Property 



Lemma 5.4. Let H, H' G R such that Hoib^H' for some A G R. Then for all w G i? w and 
all p G s^ioif) = s p (wH'). 

Proof. By definition of a[/ relations, processes in B(A) cannot distinguish H from H' meaning 
they are receiving the exact same messages from exactly the same nodes in both scenarios. 
Hence they end in the same states. □ 

Proposition 5.5. Let H,H' G R such that H(3H'. Then for every w G R* , there exist 
Hi, . . . , H g in the j3— class of H and H' , and Aq, . . . , A q G R, such that for every < i < q 
and every p G B{Ai), s p {wHi) = s p (wHi+\), where Hq = H and H q +\ = H' . 

Proof By Lemma |5.4| and definition of j3. □ 



5.4 End of Proof of Theorem liTTOl 

We will use a standard bivalency technique. We suppose that we have an algorithm that solves 
Consensus. A configuration is said to be 0— valent (resp. 1— valent) if all extensions decide 
(resp. 1). A configuration is said to be bivalent subject to L if there exists an extension in L 
that decides and another extension in L that decides 1. 

Lemma 5.6 (Restricted Initial Bivalent Configuration). If there exists a source-incompatible 
set D, then there exists an initial configuration that is bivalent subject to D u . 

Proof Suppose that {Hi,...,H q } is a source-incompatible set in D. There exist disjoint 
non-empty sets of nodes M\, . . . , Mk such that Vz, 3/ C [1, k], B(Hi) = {J Mj. 

Consider lq (resp. in which all nodes of (J Mj have initial value (resp. 1). The 

initial configuration lq is indistinguishable from the configuration in which all nodes have 
initial value for the nodes of B(Hi) under scenario Hf for every i. Hence lq is 0— valent. 
Similarly if. is 1— valent. We consider now the initial configurations ii, 1 < I < k — 1 in which 
all nodes from (J Mj have initial value 0, and all other nodes have initial value 1. 

l<J<*-2 

Suppose now that all initial configurations are univalent. Then there exists 1 < I < k such 
that is 0— valent and i\ is 1— valent. As the set is source-incompatible, there must exist 



i G [1, q] such that Mi (1 B{Hi) = 0. So, we can apply Proposition 5.3 to Hi. This means that 
all nodes in B(Hi) decide the same value for both initial configurations, Li—\ and n, under 
scenario Hf , and this is a contradiction. □ 

Lemma 5.7 (Restricted Extension). Let C be a f3— class. Every bivalent configuration in 
C u has a succeeding bivalent configuration in C u . 

Proof. Consider a bivalent configuration obtained after a partial execution subject to w £ C* . 
By way of contradiction, suppose that all succeeding configurations in C u are univalent. 
Then there exist succeeding configurations wH and wH' that are respectively 0— valent and 
1— valent, as w is bivalent. 



By Proposition 5.5, there exist H±, . . . , H q in C and Aq, . . . ,A q G R such that s p (wHi) = 
s p {wHi + \) for every < i < q and every p G B{Ai), where Hq = H and H q+ \ = H' . 
By hypothesis, all succeeding configurations wHi are univalent. As Ha B ^ Q \H\, we get that 
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processes in B(Aq) are in the same state after H and after H±. Hence, by Proposition 5.3 
they are also in the same state after HAq and after HiAq, so they decide the same value 
and wHi is 0— valent. We can repeat this for any 1 < i < q. Hence wH' is also 0— valent, a 
contradiction. □ 

We can now finish the proof with the standard bivalency arguments. Suppose that we 
have a source-incompatible set in the same f3— class C. Also suppose that there exists an 



R^— reliable Consensus algorithm for G. By Lemma 5.6, there exists an initial configuration 



that is bivalent in C u . From Lemma 5.7 we deduce that the algorithm does not satisfy 



the Termination property for Consensus on some execution subject to C w C R u , which is a 



contradiction. Using Proposition 5.1 and Theorem 4.7 we conclude the proof of Theorem 4.10 



6 Solvability of Consensus vs Broadcast 

In this section, we prove that the Consensus and Broadcast problems are equivalent for the 
large family of omission schemes that are defined over convex sets of events. This has very 



important consequences as checking Broadcastability is quite simple (see Theorem 4.4) 



Definition 6.1. A set of communication events R is convex if, for every H,H' G R and 
every a G H', H U {a} G R. 

Basically, this definition says that a convex set of communication events R is closed under 
the operation of adding a reliable communication event a from one event H' to another event 
H. This is an important subfamily because sets of events that are defined by bounding the 
number of omissions, for any way of counting them, are convex. Stated differently, adding 
links to an event H with a bounded number of omissions cannot result in an event with more 
omissions. The convexity property does not depend upon the way that omissions are counted. 

Theorem 6.2. Let R C U be a convex set of communication events over a graph G. Then 
Consensus is R u —solvable if and only if G is R u —Broadcastable. 



Proof. By Theorem 4.10 we only have to show that there is no source- incompatible set in R. 
We will show that if there is such a set {H±, . . . , H q }, then there is only one /3r class. 

There exist disjoint, non-empty sets of nodes Mi, . . . , M& such that Vi, 31 C [1, k], B(H.{) = 

U Mj. The Mj are "generators" for the sets of sources. We use Mj to denote (J Mj for any 

je/ jeJ 



J C [1, k]. Note that, as the intersection of the Hi is empty (Definition 4.6 ), for each j G [1, k], 
there exists ij such that Mj n B{Hi 3 ) = 0. 

Now, consider H\ ^ H2 £ R- We will show that Hi (3 (Hi VJH^). Using the decomposition 
into MjS, there exist three mutually disjoint (possibly empty) subsets Ji, J, J2 of [1, k], such 
that B(Hi) = Mj lUj and B{H 2 ) = Mj 2Uj . 

Let H[ = H\ U {(s,t) £ H 2 \t G MjJ. As the intersection of B(H 2 ) with M Jx is empty, 
we have Hia B ^ H2 ^H[. 

Similarly, letting H'{ = H [ U {(s, t) G Hi \ t G Mj 2 }, we have H[a B{Hl) H". 

To obtain Hi U H 2 , we need to add to H'{ the arcs with heads in B(H{) n B(H 2 ) = Mj. 

Let J = {ji, j q } and K k = H'{ [j M jt . 

l<k 

For each I G J, there exists %\ such that MiC\B(Hi t ) = 0. Therefore, for all k, Kk-iOiB^jj. \K}.. 
So finally, we obtain H x (3 (Hi U H 2 ). Similarly, H 2 (3 (Hi U H 2 ), and Hi(3H 2 . □ 



11 



Let Of(G) denote the set of communication events with at most / omissions from the 
underlying graph G. An upper bound on / for solvability of Consensus subject to Of(G) UJ 
was given in |SW07| . and it was proved to be tight with an ad hoc technique in |FGllj . We 
can now state this result as an immediate corollary of Theorem |6.2| 

Corollary 6.3. Let f G N. Consensus is solvable subject to Of{G) u if and only if f < c(G), 
where c(G) is the connectivity of the graph G. 

The equivalence of Consensus and Broadcast includes the number of rounds to solve them. 

Proposition 6.4. Let R C £ be a convex set of events. If Consensus is R? — solvable then it 
is solvable with exactly the same number of rounds as Broadcast subject to R u . 

Proof. We only have to show that Consensus cannot be solved in fewer rounds than Broadcast. 
Due to space limitations, we only present a sketch of the proof. We use the same bivalency 
technique as in Section [5] So, suppose that Consensus is solvable in r c rounds while Broadcast 
needs more than r c rounds for any originator. 

First, we show that there must be a bivalent initial configuration. Let V = {vi, . . . ,v n } 
and let n be the initial configuration in which Vi has initial value if i < I. If all initial 
configurations n, 1 < I < n are univalent, then there exists / such that is 0— valent and n 
is 1— valent. As a Broadcast from vi needs strictly more than r c rounds, there exists a vertex 
v that does not receive the value from vi, so no executions of length r c of the Consensus 
algorithm from initial configurations and q can be distinguished by v. Consequently v 
will decide the same value for both initial configurations, a contradiction. 

Now we show that if all extensions of a bivalent configuration are univalent, then the 
Consensus algorithm needs more than r c rounds to conclude. Indeed, if we have an extension, 
starting with communication event Hq, that is 0— valent, and another extension, starting with 
communication event Hi, that is 1— valent, we can repeat the above technique by adding arcs 
to Hq to obtain Hq U H\. The addition of arcs can be done by grouping them according to 
their heads. If only one node has a different state for two events, then it would need more 
than r c rounds to inform all other nodes. □ 

There are many results concerning the Broadcast problem in special families of networks. 
General graphs are studied in [CDP94j and hypergraphs are studied in [MV98J. An optimal 
algorithm for the family of hypercubes in given in [DV99]. For a hypercube of dimension 
n, if at most n — 1 messages are lost during each round, then Broadcast can be solved in 
n + 2 rounds, compared to n rounds when there are no omission faults. In [DV04] . the precise 
impact on Broadcast of the actual number of faults is given. Based on the results in |DV04] . 
we get the following bounds for Consensus. 

Corollary 6.5. In hypercubes of dimension n, if the global number of omissions is at most 
f per round, then 

1- if f > n , then Consensus is not solvable, 

2. if f = n — 1, Consensus is solvable in exactly n + 2 rounds, 

3. if f = n — 2, Consensus is solvable in exactly n + 1 rounds, 
4- if f < n — 2, Consensus is solvable in exactly n rounds. 

The following example shows that there are mobile schemes that are broadcastable but 
for which Consensus is solvable is fewer rounds than Broadcast. 
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Fig. 1. Event H x . 



Fig. 2. Event H 2 . 



Example 6.6. Let R = {H\,H2} where H\ (resp. H2) is given by Fig. 1 (resp. Fig. 2). One 
can see that d needs two rounds to broadcast in H\, and c needs two rounds to broadcast in 
H 2 . Nodes a and b need more than two rounds. However there is a Consensus algorithm that 
finishes in one round. Notice that every node can detect which of the communication events 
actually happened, so the Consensus algorithm in which every node decides the value from c 
if Hi happened and the value from d if H 2 happened uses only one round. 



7 Conclusions 

We have presented a new necessary condition for solving Consensus on communication net- 
works subject to arbitrary mobile omission faults. We conjecture that this condition is actually 
sufficient, therefore leading to a complete characterization of the solvability of Consensus in 
environments with arbitrary mobile omissions. For a large class of environments that includes 
any environment defined by bounding the number of omissions during any round, for any 
way of counting omissions, we proved that the Consensus problem is actually equivalent to 



the Broadcast problem. We also gave examples (Ex. |2.5| and Ex. 6.6) showing how Consensus 
can differ from Broadcast for some environments. 

Finally, by factoring out the broadcastability properties required to solve Consensus, we 
think that it is possible to extend this work to other kinds of failures, such as byzantine 
communication faults. 
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